InfoGrid: providing information integration for knowledge discovery

نویسندگان

  • Nikolaos Giannadakis
  • Anthony Rowe
  • Moustafa Ghanem
  • Yike Guo
چکیده

Many scientific experiments produce large amounts of data using high-throughput devices. In order to analyse this type of data Knowledge Discovery systems are required. However, generic laboratory systems do not provide any contextual information about the system that is being studied. In these situations, Knowledge Discovery can be aided and validated by the use of Information integration tools. In this paper, we introduce InfoGrid, a data integration, middleware engine, designed to operate under a Grid framework. It focuses on providing information access services and offers all users a query system which is able to retain the familiarity with their specific scientific applications while being diverse, flexible and open at the same time. The assumption there is that defining a common language for all queries is not desirable. Using this design, we show how the InfoGrid architecture can be used to provide contextual features for a data table to be used for analysis (i.e. the Annotation Problem). We also show how it can be used to find relevant background knowledge for a user (i.e. the Information Comprehension problem). Both of these issues are repeatedly found in Knowledge Discovery tasks, which we illustrate with a worked example.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Information Integration for E-science

In this paper, we introduce InfoGrid, a data integration, middleware engine, designed to operate under a Grid framework. InfoGrid provides dynamic information access and integration services to a variety of semi-structured web-based data sources. It also provides users with a query system that retains their familiarity with their specific scientific applications, while allowing them to dynamica...

متن کامل

InfoGrid: Information Resource Integration

Grid computing [3] constitutes the amalgamation of a drive towards the standardization of existing technologies that enable the collaboration of scientists overcoming restrictions of location, distance and compatibility. The aim is to exploit the full potential of resources, computational or informational. We aim to study how a network of distributed and heterogeneous grid resources could attai...

متن کامل

Designing an Ontology for Knowledge Discovery in Iran’s Vaccine

Ontology is a requirement engineering product and the key to knowledge discovery. It includes the terminology to describe a set of facts, assumptions, and relations with which the detailed meanings of vocabularies among communities can be determined. This is a qualitative content analysis research. This study has made use of ontology for the first time to discover the knowledge of vaccine in Ir...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Inf. Sci.

دوره 155  شماره 

صفحات  -

تاریخ انتشار 2003